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Abstract. Let M n be an n x n real (resp. complex) Wigner matrix and f7 n A n £/^ 
be its spectral decomposition. Set (yi, 2/2 ■ ■ • , yn) T = U^x, where x = (xi,X2, • • • , 
x„) T is a real (resp. complex) unit vector. Under the assumption that the ele- 
ments of M n have 4 matching moments with those of GOE (resp. GUE), we show 

that the process X n (t) = \J 4p ^}-^ (|j/»| 2 — conver g es weakly to the Brownian 
bridge for any x such that Hx^ — s* as n — > oo, where /3 = 1 for the real case 
and P = 2 for the complex case. Such a result indicates that the othorgonal (resp. 
unitary) matrices with columns being the eigenvectors of Wigner matrices are 
asymptotically Haar distributed on the orthorgonal (resp. unitary) group from a 
certain perspective. 

1. Introduction 

Let M n = ^(%) n n be an n x n real or complex Wigner matrix whose definition 
is stated below. 

Definition 1.1 (Real Wigner matrix). We call M n a real Wigner matrix if it is 
a symmetric random matrix such that {vij,l < i < j < n} is a collection of in- 
dependent real random variables. And Vij,l < i < j < n are i.i.d with common 
mean and variance 1. And vu, 1 < i < n are i.i.d with common mean and finite 
variance. 

Definition 1.2 (Complex Wigner matrix). We call M n a complex Wigner matrix 
if it is an Hermitian random matrix such that {vij, l<i<j<n}isa collection 
of independent random variables. And Vij,l < i < j < n are i.i.d complex random 
vaiables with common mean and variance 1. And va,l < i < n are i.i.d real 
random variables with common mean and finite variance. 

Example 1.1 (GOE and GUE). A real Wigner matrix with N(Q, 1) off-diagonal 
elements and N(0, 2) diagonal elements is called GOE. And a complex Wigner ma- 
trix with N(0, 1/2) + y/—lN(0, 1/2) above-diagonal elements and iV(0, 1) diagonal 
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elements is called GUE. Here iV(0, 1/2) + \/— 1N(0, 1/2) stands for the standard 
complex normal variable whose real and imaginary parts are independent. 

Now we denote the ordered eigenvalues of M n by Ai < • • • < X n and the corre- 
sponding normalized eigenvectors by ui,U2, • • • , u n . Set A n = diag(Ai, • • • , X n ) and 
write the spectral decomposition of M n as 

M n = U n A n U*, 

where 

U n = (ui,u 2 , • • • ,u„). 

Conventionally, we require the coefficients of the eigenvectors to be real in the real 
case. However, the choices of the normalized eigenvectors are not unique owing to 
the following two reasons. 

(rl): If there is an i G {1, 2, • • • , n} such that Aj is not simple, one can arbitrarily 
choose an orthogonal basis of the eigenspace corresponding to Aj. 

(r2):/f every eigenvalue is simple, we can rotate the eigenvector by multiplying a 
sign —1 in the real case or any phase e^ r ^ d {6 G R) in the complex case. 

Note that, when the matrix elements are continuously distributed, (rl) will cause 
no ambiguity since the eigenvalues are simple with probability one in this case. How- 
ever, even for the discontinuous case, it will be shown that whichever eigenvectors 
for the multiple eigenvalues have been taken will not affect the results of this paper 
(see the discussion at the end of Section 2 ). And actually, (r2) will also cause no 
trouble since the concerned quantities in this paper only depend on the projections 

UjU*, i = 1, • • ■ ,n 

which are uniquely defined as long as the eigenvalues are simple. However, for elim- 
inating the ambiguities in some issues we will adopt the viewpoint (iv) in [16J to fix 
the definition of the eigenvector Uj when Ai is simple. We rewrite it as follows. 

Viewpoint (iv) in |16j : In the complex case, one can replace Uj by e'^^'u^, 
where 9i is uniformly distributed on [0, 2ir). Moreover, 9i,i = 1,- • • ,n are i.i.d and 
are independent of the matrix M n . In the real case, one can replace Uj by 6jUj, where 
bi,i = 1, • • • , n are i.i.d ±1 Bernoulli variables which are independent of M n . 

Under the above viewpoint, it is well known that when M n is GOE (resp. GUE), 
U n is Haar distributed on the orthogonal group 0(n) (resp. unitary group U(n)). 
Then it is natural to conjecture whether U n of the general Wigner matrices are 
"asymptotically" Haar distributed in some sense. In other words, we care about 
the universal properties of the matrices of eigenvectors. In the past decades, a 
vast of work had been devoted to the study of the universality problems of various 
statistics of the eigenvalues. By contrast, the work on the universality of eigenvectors 
is much less. The most recent progresses on this aspect maybe the derealization or 
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localization property of the eigenvectors for different types of random matrices (see 
[S]) [Z]) 0' 15] arid [12]) and the universality for the local statistics of the eigenvector 
coefficients (see [TU], [IB]). 

In this paper, we will discuss a universality result for a global property of the 
eigenvectors. Below we give the definition of the concerned quantity of our paper 
and then explain why it is closely related to the universality of the distribution of 
U n . 

Let x = (xi, • • • ,x n ) T be a definite unit vector. That is to say, x E S 11 ' 1 in the 
real case, and x E S 2n ~ l in the complex case. Here 

S n ~ l ■= {iGl": |x| = 1}, S 271 - 1 := {z E C n : \z\ = 1}. 

Now we set the vector 

y = (yir-- ,Vn) T = U*x. 
Then we can construct a process X n {t) E D[0, 1] from the vector y as 




x n (t) = \ ^y {\vi\--i (i-i) 



where f3 = 1 is for the real case and f3 = 2 is for the complex case. Hereafter, the 
notation [^J stands for the largest integer no larger than x. In this paper, we will 
discuss the limit of the process in the weak sense. Such a problem was raised 
by Silverstein in [13] and was shown to be closely related to the universality problem 
on U n . 

Note that when U n is Haar distributed on the orthogonal group 0(n) (resp. uni- 
tary group U(n)), it is well known that for any real (resp. complex) unit vector x 
one has that y is uniformly distributed on S" 1-1 (resp. S 2 ™ -1 ). Then for the real 
case, one has 

d gR 

y = Ti — m' 

IIskII 

where := (<?i, • • ,g n ) T is a Gaussian vector with i.i.d. iV(0, 1) coefficients. 
Similarly, for the complex case one has 

d gC 

y = ti — m> 

llgcll 

where gc := (?7i+\/— lClj " ' > ^n+V - lCn) T is a Gaussian vector with i.i.d. N(0, 1/2)+ 
\f— TiV(0, 1/2) coefficients. Note that actually we can choose the Gaussian variables 
with arbitrary common variance because of the scaling invariance. Here we just 
specify them to be standard for convenience. Then by using the classical results on 
weak convergence, it is elementary to see for the Gaussian case 

X n (t)=Uw°(iL), t€[0,l], (1.2) 

where W°(t) is the standard Brownian bridge. 

Conversely, (|1.2p reflects the fact that y is uniformly distributed on sphere in a 
global sense. Thus if (|1.2|) is valid for general Wigner matrices with a large class of 
x, we can regard that U n of general case is "asymptotically" Haar distributed from 
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such a certain perspective. Such a "measure" of closeness between the distribution 
of U n and the Haar distribution was first raised by Silverstein in [33] for the sample 
covariance matrices. However, Silverstein only succeeded in proving the result for 
the unit vectors x = (^l/^/n, • • • , ±l/y/n) T under the assumption that the matrix 
elements are symmetrically distributed. As discussed above, in the Gaussian case, 
x can be arbitrary. Thus to verify (jl.2p for more general x rather than those in [13] 
is crucial. In this paper, we will prove (jl.2p for a large class of Wigner matrices 
under the restriction of II 

■^-||oo — ^ which will be shown to be necessary for the 
concerned problem in the general distribution cases (see Remark 13.51 below). Here 
| |x| | oo = maxi<j< n \xi\ is the maximum norm of x. Moreover, we do not need the 
symmetrical distribution condition imposed on the matrix elements. 
To state our main result, we need an ad hoc terminology. 

Definition 1.3 (Matching to the k-th moments). We say that two Wigner matrices 
M n = l/y/n(vij) and M n = l/^/n(vij) match to the k-th moments if 

E(5R(t; ii )'9( % -) m ) = E(3tf(^)^(%0 m )> 0<l,m<l + m<k. 

In the sequel, we will specify k = 4. That means we require the elements of two 
concerned Wigner matrices have the same first four moments. Moreover, throughout 
the paper, we will need the following additional condition on the matrix elements. 

Condition 1.4. We assume the matrix elements Vij 's have uniform subexponential 
dacay. That is, 

P(|%| >t)< C" 1 exp(-t c ) 

with some positive constant C independent ofi,j. 

Now we can state our main result. 

Theorem 1.5. Assume that M n is a real (resp. complex) Wigner matrix matches 
GOE (resp. GUE) to the A-th moments. Moreover, we assume that M n satisfies 
Condition ] 1.4\ For any definite real (resp. complex) unit vector x such that ||x||oq — ^ 
as n tends to infinity, we have 

rw LniJ i 

Xn(t) = f-^Y,^\ 2 --)^ W °^- 

i=l 

Here j3 = 1,2 in the real case and complex case respectively. 

Hereafter, when we refer to "limit" and "accumulation point" of a random se- 
quence, they are always in the sense of weak convergence. Moreover, for simplicity, 
when there is no confusion, we may omit the time parameter t from X n (t) and 
W°(t). 

The main proof strategy will benefit from the discussions in [13]. Specifically, a 
criteria for weak convergence for a random sequence on D[0, 1] with its limit sup- 
ported on C[0, 1] was provided in [T3] (see Theorem 3.1 of P2]). Such a criteria can be 
regarded as a slight modification of the classical 11 finite dimensional convergence+ 
tightness" issue. The discussions in [T3] and the recent result of Bai and Pan [T] can 
help us to confirm that the unique possible C[0, l]-supported accumulation point of 
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(X n ) n >i is W°. Then it remains to show that (X n ) n >i is tight and can only has 
C[0, l]-supported accumulation point. However, it has been shown in [p3] that the 
proof of the tightness of the sequence (X n ) n >i is an obstacle to this problem. In 
order to show the tightness, Silverstein imposed the additional symmetrical distribu- 
tions condition on the matrix elements and restricted the discussions on the special 
cases of x = (±l/y/n, ■ ■ ■ ,±l^/n) T in [13] for the sample covariance matrices. Ac- 
tually, Silverstein's proof can be adopted after slight modifications for the Wigner 
matrices under similar restrictions as those imposed in |13j for the sample covariance 
matrices. 

To remove the restrictions mentioned above, we will use a totally different method 
which is based on the so-called isotropic local semicircle law proposed by Knowles 
and Yin in [11] quite recently. Crudely speaking, We can verify the tightness of 
(X n ) n >i through providing some good upper bounds on the fourth moments of the 
increments of X n (t). Such bounds will turn out to be easily obtained for the Gaussian 
case owing to the explicit distribution information of y. For more general Wigner 
matrices, we will use the idea of comparing the general case with the Gaussian case. 
Such a comparison method relies on the classical Linderberg strategy to replace the 
matrix elements by those of the "reference" matrix one pair (or one unit in the 
diagonal case) each time and then evaluate the change of the concerned quantity 
induced by the replacement on each step. Then by a telescoping argument we can get 
the difference of the concerned quantities of two Wigner matrices. Such an approach 
was comprehensively used in the literature of the Random Matrix Theory recently. 
One can see [14j . [8] and [9] for instances. Particularly, one can refer to[10]. |16] and 
[2] for the applications of such a strategy on some problems about the eigenvectors 
of the Wigner matrices. 

More precisely, we will mainly pursue the idea of the Green function compari- 
son approach raised by Erdos, Yau and Yin in [3J. To this end, at first, we will 
approximate the increment of the process by a quantity expressed in terms of the 
Green function. Then we will perform a replacement issue on the Green functions to 
achieve the purpose of comparison. Such a strategy will heavily rely on the isotropic 
local semicircle law provided in |llj . 

Our paper will be organized as follows. In Section 2, we will present some nec- 
essary preliminaries. And in section 3, we will provide a criteria of the weak con- 
vergence of X n (t) which contains two statements. It will be shown that the first 
statement can be implied by a recent result of Bai and Pan [lj, thus we will just 
sketch the proof of this statement at the end of Section 3. The second statement 
is mainly about the tightness of the sequence (X n ) n >i, which will be handled in 
Section 4. 

Throughout the paper, the notations C, C\, C and K will be used to denote some 
n-independent positive constants whose values may defer from line to line. The 
notation || • \ \ op stands for the operator norm of a matrix. 

We will say an event E occurs with overwhelming probability if and only if 

P(E) > 1 - n~ K 
for any given positive number K when n is sufficiently large. 
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2. Preliminaries 

In this section, we will state some basic notions and recent results, especially some 
known results on the Green functions which will be frequently used in the proof of 
Theorem 11.51 

The so-called empirical spectral distribution (ESD) of M n is defined by 



1 n 



n 

i=l 

It is well known that F n {x) almost surely converges weakly to Wigner's semicircle 
law F sc (x) whose density function is given by 

Psc{x) = ^2^4- x 2 l {N < 2 }. (2.1) 

The Stieltjes transform of a probability measure fi can be defined for all complex 
number z = E + y 7 — It/ E C \ K as 

m^iz) = I — - — /j,(dx). 
J x- z 

Here E and ij are the real and imaginary parts of z respectively. Thus by definitions, 
we have 



m Fn (z) = - V^— = -tr(M n -zI)-\ (2.2) 
n t—' Aj — z n 

i=i 



and 



2 1 



m sc (z) = / p sc (x)dx. 

J-2 x - z 

For simplicity of the notation, we will briefly write m Fn {z) and m Fsc (z) as m n (z) 
and m sc (z) respectively. It is well known that 

sup \m sc (z)\ = O(l). 



zed 

The Green function G n (z) of M n is defined by 

G n {z) := (Gy(z)) n , n = (M n -zln)- 1 
which is also called the resolvent of M n . Now by (|2.2p . we also have 

m n (z) = -trG n (z) = - Gu{z). 



It was shown by Erdos, Yau and Yin in |9j that when 

z £ S := {E + V^It] : \E\ < 5, n _1 (log n) cloglogn < r? < 10} (2.3) 

for some positive constant C, one has that m n (z) is well approximated by m sc (z) 
with overwhelming probability. Moreover, it was proved in [S] that Gii(z)'s are close 
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to m sc {z) and Gjk(z)'s (j 7^ k) are small in the sense that for some positive constant 
C, 



max\G H (z)-m 9C (z)\+max\G jk (z)\ < (lognf lo ^ n I ^ m sc(z) + J_\ 

holds uniformly for z G S with overwhelming probability. See Theorem 2.1 of [9j for 
details. 

Now if we denote the standard basis of W 1 by ei,e2, • • • , e n conventionally i.e. 
ej is the nxl vector with only the i-th component being 1 and the others being 0. 
Then we can write 

Gij(z) = e*G n (z)ej. 
Recently Knowles and Yin generalized the estimation (|2.4p to the quantities 

Gvw := v*G n (z)w 

for any definite unit vectors v, w in [11] and provided the so-called isotropic local 
semicircle law. Meanwhile, for any unit vector v they also provided in [11] the 
uniform upper bounds for the quantities 

|(ui,v)|,i = 1, • • • ,n. 

And they named the control on the quantities above as the isotropic delocalization 
of the eigenvectors, which can be viewed as a generalization of the delocalization 
property for eigenvectors raised in [6]. Both the isotropic local semicircle law and 
isotropic delocalization property will be crucial to our analysis in the sequel. We 
remark here the assumptions imposed in [11] are weaker than those made in our 
paper. We refer to [11] for details and will not mention this fact again in the sequel. 
For convenience, we will reformulate their results as the following lemma under our 
assumptions. 

Lemma 2.1 (Knowles and Yin, [llj). Under the assumptions of Theorem 1 1.51 we 
have the following two statements. 

(1): (isotropic local semicircle law). For z £ S, there exists some positive constant 
C such that 



\(v,G n (z)w) - m sc (z){ 



v,w)| < (logn) Clo ^n ( I^M + A.) (2.5) 

\ y nrj nr\ J 

with overwhelming probability for all deterministic and normalized vectors v, w € 
C n . 

(2): (isotropic delocalization). For any deterministic and normalized vector v G 
C n , we have 

o flog ri) clo 8 l °Sn 
supKu^v)! 2 ^^^ (2.6) 

i n 

for some positive constant C with overwhelming probability. 
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Remark 2.2. We remind here that the validity of Ii2.6)) does not depend on the 
ambiguity of the choices of the eigenvectors caused by (rl ) and (r2 ). For details, 
see the proof of Theorem 2. 5 of [IT] . 



Remark 2.3. Actually, we will need \2. 5)) to hold uniformly for all z £ S with 
overwhelming probability in some discussions below. Note that 

||G'(*)|U<7T 2 . (2.7) 
Now we choose an e-net of S with e = n~ K with sufficiently large K. Then we have 



\2. 5\) holds uniformly on the e-net with overwhelming probability. By using (2.1) 
and the elementary mean value theorem, we can get that \2. 5\) uniformly holds on S 
with overwhelming probability by slightly adjusting the constant C in 112. 5\) . 



At the end of this section we explain that the ambiguity caused by (rl) mentioned 
in Introduction does not influence the limit property of X n (t). Now let ji := 7j >n G 
[—2, 2] be the classical location of Aj in the sense that 

7i i 

p sc (x)dx = -. 
-2 n 

It is easy to check that for some positive constant C, 

hi - 7i+i| < C[mm(i,n - i + l)]- 1/3 n _2/3 , i = 1, • • ■ , n - 1. (2.8) 

By the rigidity property of the eigenvalues which was proved by Erdos, Yau and Yin 
in [5] we see that with overwhelming probability, the event 

n 

< (log^^^^^Imin^^-i + l)]- 1 /^- 2 / 3 } (2.9) 

i=i 

holds for some positive constant C\ when n is sufficiently large. Now we assume 
that there is an no such that 

A no < ^no+l = • • • = A no +fc < A n() +fe+i. 

Note that although the eigenvectors u no+ i, • • • , u no+ i t can be chosen in many differ- 
ent ways, the choice of the projection matrix 

( u no + l)"' - i u n +fc)( u n + l; " " " > u n +fc)* 

is unique. Thus the quantity 

no+fc j , 

{\Ui\ 2 - — ) = X*(u n() + i, •• • , U no+fc )(u nQ+ i, • • • ,U no+ fc)*X— — 

i=n +l 

is uniquely defined. This shows that the definition of X n (t) does not depend on the 
choices of the eigenvectors as long as A^ nt j is a simple eigenvalue. Now if A^j is 
not simple, we can assume that uq + 1 < [nt\ < uq + k without loss of generality. 
Following from (|2.8|) and (|2.9p , it is not difficult to see with overwhelming probability, 
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there is no eigenvalue with multiplicity larger than (log n) 2C<1 loglogr \ For tiq + 1 < 
[nt\ < no + k, we can write 

X n (t)=X n (n /n) + JB1 V (|^| 2 --)- (2-10) 
V 2 ^-^ n 

i=no+i 

Then by the fact k < (log n) 2C<1 loglog?1 with overwhelming probability and the 
isotropic derealization property (|2.6p we see that the second term on the right 
hand side of (|2.10|) (not well defined term) can be discarded in probability. More- 
over, since both the upper bound of the multiplicity of eigenvalue and isotropic 
derealization property hold uniformly in i = 1, ••■ ,n, the above discussion also 
holds uniformly in t £ [0, 1]. Thus, the limit behaviour of X n (t) does not depend on 
the ambiguity caused by (rl). 

3. Uniqueness of the C[0, 1]-supported accumulation point 

In this section, we will provide some known results and mainly show that W° is 
the unique C[0, l]-supported accumulation point of (X n ) n >x. Similar to the discus- 
sions in [2], to prove Theorem 11.51 it suffices to verify the following two lemmas. 



Lemma 3.1. Under the assumptions of Theorem \1.5\ the sequence (X n ) n >i has W° 
as its unique possible accumulation point supported on C[0, 1]. 

Lemma 3.2. Under the assumptions of Theorem \1.5L the sequence (X n ) n >i is tight 
and can only have C [0,1] -supported accumulation points. 

The remaining part of this section will be devoted to the proof of Lemma [3.11 To 
this end , we will need the Theorem 1.2 of pQ. For convenience of the reader, we 
rewrite it here. Let 

W n (g) = v / ^(x* 5 (M n )x - -trg(M n )), 

n 

where g(x) is a function analytic on a region in the complex plane covering the 
interval [—2, 2]. Then we have the following theorem provided by Bai and Pan in [1] 

Theorem 3.3 (Bai and Pan, [I]). Let M n be a real or complex Wigner matrix 
satisfying E|fi2| 4 < oo. Suppose that gi,-- - ,gu are analytic on an open interval 
including [—2,2], and that 

||x||oo -> 0. 

(1) If M n is real, i.e. (3 = 1, and Euf 2 = 0, then W n {g%), ■ ■ ■ ,W n {gk) converges 
weakly to a Gaussian vector Wt with mean zero and covariance function 

Cov(W gi ,W g2 ) = 2n J gi (x)g 2 (x)dF sc (x) - J gi (x)dF sc (x) J g 2 (x)dF sc (x)\ (3.1) 

(2) If M n is complex, i.e. j3 = 2, and Eu 2 2 = and E,vf 2 vi2 = 0, then (1) remains 
true. 
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Remark 3.4. We remind here that W n (g) in the complex case is different from 
X n {g) in pQ in scaling. 

Remark 3.5. It has been shown that the condition HxH^ — > is necessary for 
Theorem \3.3[ See Remark 1.4 of[l\. 



Now we begin to prove Lemma 13.11 

Proof of Lemma Um Changing the variable t by F sc (u), one can see that it is equiv- 
alent to verify that the sequence {X n (F sc (u))) n >\ has W°{F sc (u)) as its unique 
possible accumulation point supported on C[— 2, 2]. We claim that it suffices to 
show the following two statements. 

(a) : We have 

2 ^ 00 f r-2 

u r X n (F sc (u))du \ ^\ u r W°{F sc (u))du 

(b) : The distribution of a process X(u) supported on C[— 2,2] is uniquely deter- 
mined by the distribution of 

2 

u r X(u)du 

' r=0 

Below we sketch the proof of Lemma |3 . 1 1 providing (a) and (b) at first. Note that 
if we assume that one convergent subsequence {X n / (F sc (u))} converges weakly to 
some C[— 2, 2]-supported process X(u), then by Theorem 5.1 of [3], one has 

u r X n ,{F sc (u))du \ { u r X(u)du\ 

-2 J r=0 U-2 J r=0 

Meanwhile, by (b) we also know that if X(u) is C[— 2, 2]-supported, its distribution 
is uniquely determined by the distribution of 

2 

u r X(u)du ! 

-2 J r=0 

Thus we have {X n '(F sc (-u))} converges weakly to W°(F sc (u)) as n' — > 00. Therefore, 
we have Lemma l3.1l bv (a) and (b). It remains to verify (a) and (b). The proof of 
(b) is nearly the same as the counterpart of the proof for Theorem 3.1 of [13]. Thus 
here we omit the detail. 

To verify (a) for X n (F sc (u)), we will work on the slight modification X n (F n (u)) 
instead. Note that by the rigidity property which was proved by Erdos, Yau and 
Yin in [9] we see that there exists some positive constant C such that 

sup \F n (u) - F sc (u)\ < (3.2) 

\u\<5 n 
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with overwhelming probability (see Theorem 2.2 of [9j). Thus we have for some 
positive constant C 

1 Hop- r?"l C " loglogn 

sup \X n (F sc (u)) - X n (F n {u))\ < ^(logn) Closlogn max(|^| 2 + -) < 1 S >-= 

|tt|<5 1 n V n 

with overwhelming probability. Above we have used the isotropic delocalization 
property (|2.6p . Therefore, it suffices to study the limit behaviour of 

\ oo 

u r X n (F n {u))du \ . 
. M<5 J r=0 

Moreover, also by the rigidity property provided in [9], we see that all the eigenvalues 
of M n are in the interval [—5, 5] with overwhelming probability. Combining with the 
fact that 

X n (0) = X n (l) = 0, 

we have 

1 °° ( f' + °° ~) °° 

u r X n (F n (u))du \ =\ u r X n (F n {u))du \ 

|u|<5 J r=0 U-oo J r= o 

with overwhelming probability. Relying on the discussion above one can transfer 
the problem to show that 

u r X n {F n {u))du \ ^\ u r W°{F sc {u))du \ . 

By integration by parts, it suffices to verify 

u r dX n (F n (u)) =>\ u r dW°(F sc (u)) . (3.3) 



r=0 <J-2 J r =0 



Note that 



y°° u r dX n (F n (u)) = ^ L*Mfr-hrM*\ 



Thus we arrive at the stage to use Theorem l3.3l Note that Theorem 13 . 3 1 only depends 
on the first three moments of the matrix elements. And the GOE and GUE obviously 
satisfy the moment assumptions in Theorem l3.3l Moreover, by the discussions above 
and (|1.2p for the Gaussian case, ()3.3p is valid for GOE and GUE obviously. Hence, 
(|3.3|) also holds for general Wigner matrices under the assumptions of Theorem 11.51 
So we complete the proof. □ 

4. Tightness of (X n ) n >i 

In this section, we will prove Lemma 13.21 At first, we show that the process 
sequence (X n ) n >\ can only have C[0, l]-supported accumulation points. It suffices 
to check that the maximal jump of the process X n (t) converges to zero in probability. 
This can be seen directly from the isotropic delocalization property (|2.6|) . Thus the 
remaining part of this section will be devoted to showing the tightness of the process 
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sequence (X n ) n >i, which is the main part of our proof. To this end, we begin with 
the modulus of continuity of the process X n as 

wxM = w{X n ,8) := sup \X n {t x ) - X n (t 2 )\, < 5 < 1. 

1*1— *al<5 

By Theorem 8.2 of the Billingsley's book [4J, to prove the tightness of (X n ) n >i, it 
suffices to show the following two statements. 

(I) : For each positive rj, there exists an a such that 

P(|X n (0)| >a)<r), n > 1 

and 

(II) : For each positive e and rj, there exists a 8, with < 8 < 1, and an integer 
no such that 

P{w Xn {8) >e) <v, n>n . 

Note that (I) is obvious in our case since 

P(X n (0) = 0) = 1, n > 1. 

Therefore, it suffices to show (II) in the sequel. We will rely on the following lemma. 

Lemma 4.1. Assume that M n satisfies the assumptions imposed in Theorem \1.5[ 
Let e be some sufficiently small but fixed positive constant. If for any t\,t2 £ [0, 1] 
satisfying |t 2 — ti| > n -1 / 2-6 there exists 

E (X n {t 2 ) - X n (t0) 4 < C\t 2 - h\ a (4.1) 

with some positive constants C and a > 1 which are both independent oft\,t2, then 
(II) holds. 

Proof. Note that by definition, we need to show that for any positive e and rj, there 
exists a 5 S (0, 1) and a sufficiently large uq such that for n > uq 

Pi sup \X n (t 2 ) - X n (h)\ >e) < v . 
\]*2-ti|<<5 / 

By the discussions in [1] (see (8.12) of p£]), it suffices to show that for n > uq and 

o < h < l, 



sup \X n (t 2 ) - X n (tx)\ >e/3) <n. 
M<t2<h+S I 



Now we set 

m = m(n) := [n l ' 2+€ ' 2 \ . 

Note that 

< sup \X n {t 2 ) - X n {tx)\ - max |X n (ii + —8) - X n (ti)| 

ti<t 2 <ti+5 0<i<m m 
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< max V (| yi | 2 + -) < Cy^— max(|yJ 2 + -) < C5n~ e/A 

i<j<m n mi n 

;=K*i+^)J 

with overwhelming probability for sufficiently large n. Here in the last inequality 
above we used the isotropic derealization property (|2.6p . Thus it suffices to show 
that for m = \n l / 2+e / 2 \ , 

( max |X n (ii + ^-<5) - X n {h)\ > e/4] < V /2. (4.2) 

By Theorem 12.2 and the proof of Theorem 12.3 of jl], to obtain (|4.2j) . it suffices to 
show that for any t±,t 2 £ [0, 1] such that | £2 ~~ t\\ > one has 

^{X n {t 2 )-X n {t x )f <C\t 2 -t x \ a 

with some positive constants C and a > 1. When n is sufficiently large, by the 
definition of m, it suffices to prove (|4.ip when £2 — ti > n~ 1//2 ~ e . Thus we complete 
the proof. □ 

Below we will verify the condition (|4.ip of Lemma 14.11 for ti,t 2 £ [0, 1] such that 
t 2 — t\ > n _1 / 2_e . At first, we construct a modified process as 

r-r nF n (Ff c 1 (t)) 

V 2 n 

Here we specify F" 1 ^) = -2 and F' 1 ^) = 2. Note that Y n (s),s £ [-2,2] is just 
X n (F n (s)) restricted on [—2,2]. Moreover, 

X n (t) =X n (F sc (F- 1 (t))), t G [0,1]. 

Then by using (|3.2|) again one obtains that with overwhelming probability, 

sup \Y n (F-\t))-X n (t)\ < v^(logn) cl °s 1 °s"max(| 2/4 | 2 + -) 
te[o,i] % 71 

\/n 



Moreover, we also have the definite bounds 

™ 1 _ 

Then by combining (|4.3|) and (|4.4|) . for t 2 — > n~ 1 ' 2 ~ t we have 

E((y„(F~ 1 (i 2 )) - Y n (F-\t x ))) - (X n (t 2 ) - X n ( tl ))f < C(t 2 - tl f. (4.5) 

Therefore, it suffices to show that for any t x ,t 2 £ [0, 1] satisfying t 2 — t\> n _1 / 2_e , 
one has 

E(y n (F- 1 (t 2 )) - Y^F-'ih)) 4 < C(t 2 - h) a (4.6) 
with some positive constants C and a > 1. Now set 

s 1 = F- 1 (ii), ^FTc 1 ^). 
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By the explicit formula of the semicircle law (|2.ip . it is elementary to see that when 
t 2 — t\ > n~ 1 / 2 ~ t , there exists s 2 — s\ > Cn~ l / 2 ~ e for some positive constant C. 
Actually, one has 

C(t 2 - h) <s 2 - Sl < C'(t 2 - ti) 2/3 . (4.7) 

holds uniformly in si, s 2 with some positive constants C and C . Thus it suffices to 
verify that when 

s 2 -s 1 > Cn~ l / 2 - € , 

one has 

E(Y n (s 2 ) - Y n ( Sl )) 4 < (s 2 - Sl ) 2 < C'(t 2 - ti) 4 / 3 . (4.8) 

Then (|4.5p together with (|4.8|) imply (|4.6|) with a = 4/3. Note that, by definition 
we have 

r^-nF n {s) r—— n 1 

^) = Vt E(IwI 2 -^) = VtBIwI 2 -^) 1 {^ ^[- 2 ' 2 ]- 

i=l i=l 

Thus 

l~Bn n 1 

Y n (s 2 ) - Y n {s x ) = J ! y ^(M 2 - -)1{ S1 <A 1 < S2 }, si, «2 € [-2, 2]. 

i=i 



In the sequel, we will show (|4.8p by a Green function comparison strategy. To this 
end, at first, we will approximate the indicator functions l| Sl<Ai<S2 |, i = 1, ••• , n 
by smooth functions expressed in terms of the Green function with the help of the 
following lemma. 

Lemma 4.2. Under the assumptions of Theorem \1.5\ for r] = n~ l / 2 ~ e {s 2 — si) 1//2 
with some sufficiently small but fixed positive constant e, when s 2 — s\ > Cn^ 1 / 2 ^ e 
with some positive constant C , we have 

TOi/i some positive constant C . 

Proof. By the isotropic derealization property (|2.6p and the definite bound |yj| 2 < 1 
we see that it suffices to show for some sufficiently small constant e > 0, 

l 



,r'*> 



E E 



1 

l{si<A 4 <s 2 } 



7T 7 S1 A, - (£ + ^Ir]) 



< C'(s 2 - Sl ) 2 (4.9) 



\i=l 

Now we choose 

9 := n- 1 / 2 -'/ 2 {s 2 - Sl f' 2 » rj > CrT^ . (4.10) 

Observe that both rj and 9 are much less than s 2 — s±. Now we split the real line 
into K = Li U Jj 2 , where 

Li = (-co, si - 9) U (si + 6>, s 2 - <9) U (s 2 + 0, oo), L 2 = M \ Li. 
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We will show that when Aj € Li, one has 



1 



l {s 1 <X i <s 2 } 



S 2 



1 



-dE 



7T J S1 \i-{E + 
To see (j4.11|) . we use the following elementary fact. 



<cy- 



' + t^— r). (4.11) 



1 



7T J S1 Xi-{E + y/^ln) 



-dE 



\ — si Aj-s 2 



vr7 Sl (Aj - £7) 2 + r/ 2 

1 s 2 - Aj si - Aj 

— (arctan arctan 

7T 7/ rj 



Note that when Aj S Li, then one has 

|Aj - s 2 |, |Aj > 6* > rj. 
By the basic asymptotic properties of arctan(x) one has for Aj G Li, 



1 1 

+ 



Ai-si| |Aj - s 2 | 



1 {si<A i <s 2 ) ~ -(arctan — — - arctan — — ) < Crj 

7r 77 77 

with some positive constant C. Let N n (I) be the number of the eigenvalues falling 
into the region I 6 R. Then we have 



E 

t=i 



L {si<Ai<s 2 } 



7T 



1 



1 Xi -(E + irj) 



< 



+ 



si Ai — s 2 



+ iV„(L 2 ) 



(4.12) 



Now we use the so-called local semicircle law (for instance, see Theorem 1.8 of |15j ) 
in the sense that for any interval J Gl with its length |7| > n _1+c for any sufficiently 
small but fixed constant c > 0, 

N n (I) = 0(n\I\) (4.13) 

with overwhelming probability when n is sufficiently large. Now we decompose the 
real line as 



(-00, -5) U ( J Ik) U(5,+oo), 



where K n = 0(n 1 c ) and 

I k = [-5 + (k— l)n~ 1+c , -5 + kn- 1+c \. 
Here we can choose K n such that 

-5 + (K n - l)n- 1+c < 5, -5 + K n n- 1+c > 5. 
We will show that 



E ' 



1 



+ 



1 



Ai - si Aj — s 2 



< Cn log n 



(4.14) 
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with overwhelming probability. At first, by the rigidity property provided in [S] we 
see that all the eigenvalues of M n are in [—5, 5] with overwhelming probability. Thus 
it suffices to show with overwhelming probability, 



E 



1 



+ 



f 



i:AiSLin[-5,5] 



Aj-si Aj-S2 



s E E < 

k=l i:AiGLin/fc 

* C E E 

fc=i r.Xi&Linik 



+ 



1 



' + (k- l)n- 1+c 



< 



^E 



n\h. 



s 9 + (A; - l)n- 1 + c 
< Cn log re. 

Here in the third step we have used f|4. 13j) . Moreover, by (|4.10p and f|4. 13|) we also 
have 



iVn(L 2 ) < Cn9 



(4.15) 



with overwhelming probability for some positive constant C. Combining (|4,12D . 
(|4.14p and ()4.15|) we have 



E 

i=l 



1 {si<X l <s 2 } ~ ~ >* 



7T 



\-{E + y/=l v ) 



< C(rjn log n + n#) (4.16) 



with overwhelming probability. Now by noticing that the left hand side of (|4.16|) is 
bounded by 2n definitely, we also have 



ME 



l {s 1 <\ i <s 2 } 



1 



7T 



«2 



1 



Ai - {E + V^V) 



< C{nn\ogn + n9) 4 . (4.17) 



Then by (|4,17p and the definitions of 6 and r\ we can see (|4.9|) . Thus we complete 
the proof. □ 



Note that 



£(!*■ 



L 1 



n 7r 



Ai - (E + V^lrj) 

- [ 2 9(x*G(z)x - -trG(z))dE 
n J Sl n 



-dE 



(4.18) 



For simplicity, we will use the notation in [llj to write 4 VW = v*Aw for any matrix 
A. Particularly, A vei and A eiV will be simply denoted by 4 vi and 4 iv in the sequel. 

Then with the aid of Lemma 14.21 and ()4.18p , it suffices to prove the following 
lemma. 

Lemma 4.3. Let z = E + \/— lrj with r\ = n~ l l 2 ~ e (s2 — si) 1 ^ 2 , where e is some 
sufficiently small but fixed positive constant. And we assume that s\,S2 G [—2,2] 
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such that s 2 — si > n~ 1 / 2 ~ e . Under the assumptions of Theorem \l.b\ one has 

E [y/n 2 ^{G^{z) ~ ^trG{z))dE^j < C{s 2 - si) 2 . (4.19) 

Proof. Below, we will focus on the real case for simplicity. The proof for the complex 
case is just analogous. Let 

Jn 



be GOE and G{z) be its corresponding Green function. At first, we will show that 
(|4.19p holds for GOE. Note that by Lemma 14.21 we can go back to the original 
quantity to show 

E(Y n (s 2 ) - y„(si)) 4 < C{s 2 - Sl ) 2 . (4.20) 
By (|4.7p it suffices to show that 

E ( v^X)(bi| 2 - ;)l{ sl <A t < S2 }) < C(t 2 - h) 2 . 



n 
i=l 



Note that for the Gaussian case, y is uniformly distributed on S™ . By using (|3.2|) 
and the isotropic derealization property (|2.6p again, we see that it suffices to verify 

(ll/«| 2 --) <C(t 2 -t!) 2 . (4.21) 
\ i=Yntx\ n ) 



Recall the fact 

d gi 



y M, 

IISkII 

Here gig = (g\, ■ ■ ■ ,g n ) T is the n X 1 random vector with i.i.d N(0, 1) coefficients. 
Note that it is elementary to see 

1™ = Ett^t = 0(n~ m ) (4.22) 
||gR|| m 

for any given integer m > 0. Moreover, we have the following lemma. 

Lemma 4.4. Assume that y = (y\, • • • , y n ) T is uniformly distributed on S"" --1 . For 
any i,j,k,l different from each other, 

h(vj-htfk--\ 

n J n n 



nvt - r)(yf - zM - r) 2 = 0(n- 5 ), (4.23) 



n J n n n 

Proof. Note that we always have 



- -)(V] - ~M - -)(Vl --) = 0(n- 6 ). (4.24) 



n 
t=l 
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Therefore, we have 



^2\ 



lb lb I 1/ 

° = E(E( y f-^))(E(^4 ))( ^4 

1=1 j=l fc=l 

= E^ 2 -^-^) 2 + 2 E E ^ 2 ->i-^) 

+ E nyf--)(y]--)(yl--) 2 

i,j,k are all different 

E nyt- l -){y j r --)(yl--? + o{n~% (4.25) 

i,j,k are all dif ferent 

where the last step follows from (|4.22p directly. Now note that y is an exchangeable 
random vector. Thus by symmetry, we see that every term in the summation in 
(I4.25|) is the same, thus (|4.23p follows. (|4.24p can be verified similarly. Note 



n 1 n ^ n 

E(E(y 2 - i ))(E^ 2 - 1 ))(E(y 2 - 






i=l j=l k=l 


1=1 




E^ 2 - i ) 4 + 4E^ 2 : -~) 3 (y] 


-b 










+3^E( y f-i) 2 ( y |-i) 2 
* — ' n J n 












+6 E E(,f-i)( y |- 






i,j,k are all dif ferent 






+ E ^ - > 2 - 






i,j,k,l are all dif ferent 






E E ^ 2 " ~){y 2 3 - 




--)+0(n- 2 
n 



i,j,k,l are all dif ferent 

where the last step follows from (|4.22p and (|4,23p . Now again by symmetry, we can 
get (|4.24p . So we complete the proof. □ 

Now by using K22\i . (j4T23l) and ()4T24"D we have 

4 



E I £ (f? ~ b ) 

\i=|ntij / 



\nt 2 \ \nt 2 \ 

E e(^ 2 - 1 ) 4 + 4 E Hyf--f(y]-- 

i=\nt-i_\ i^j=\nt±\ 
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+3 £ E( y 2 -I) 2 ( y 2 

i^7=|ntij 

[nt 2 J 

+6 £ E(y. 2 

»J,fc=LntiJ 
i,j',fc are aii different 

[nt 2 \ 

+ £ 

i j',fc,i=LntiJ 
i,j,k,l are all different 

< Cn- 2 (t2-h) 2 , 

which implies (|4.2ip for GOE. Moreover, by the discussions above, (|4.19p for the 
Gaussian case follows. 

Therefore, it remains to compare the general case with the Gaussian case. That 
is to say, we only need to show that 

E (J 2 9(G xx (z) - ^trG(z))dE\ - E (J ' 9(G xx (z) - ^trG(z))dE 

< Cn- 2 (s 2 - si) 2 . (4.26) 

To simplify the discussions in the sequel, we truncate the matrix elements of 
M n and M n at n e , where e > is the small constant chosen in Lemma 14.31 By 
Condition 11.41 we see that the truncated matrices coincide with the original ones 
with overwhelming probabilities. Thus their corresponding Green functions also 
equal to the original ones with overwhelming probabilities. Following from this fact 
and the definite bounds 

||G(z)|| op ,||G(z)|| op <7r\ 

we see it suffices to prove (|4.26|) for the truncated matrices. Therefore, below we 
will make the additional assumption of 

max|wj,| < n e , max|vj,-| < n € . 
ij ' ij 

Note that the truncation may change the first four moments of the original elements 
by tiny amounts. Actually, such minor changes are smaller than n~ K with any 
positive constant K when n is sufficiently large. It will be clear that such small 
changes on the moments of elements do not affect our comparison procedure. So for 
simplicity, we will still regard that the two truncated matrices matches to the first 
four moments. Moreover, note that all the results needed from the references such 
as [9] and [11] hold with overwhelming probabilities for the original matrices while 
the truncated matrices equal to the original ones with overwhelming probabilities, 
thus the results from these references are still valid for the truncated matrices. 

The main idea to show (|4.26p is a Green function comparison strategy based on 
the discussions in [TT] . To pursue this approach, we need to introduce some notation. 



ii 



-)(y 2 3 --)(yl--) 2 

n J n n 



n 



■)(»? 



n 
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At first we assign a bijective ordering map <p on the index set of the matrix elements, 
<t>:{(i,j):l<i<j<n}^{l,--- , y ^ ' }. 

For 1 < 7 < n(n + l)/2, we define the matrix Mn to be the Wigner matrix with 
its element being Vijj^fn if <p(i,j) < 7 or Vij/^Jn otherwise. Correspondingly, 
we denote the Green function of Mn by G 1 {z). Thus it suffices to estimate the one 
step difference 

E ^jf 2 %Gl^{z) - hrG~ ( {z))dE^ - E Qf * ^(G^ 1 (z) - hrG^ 1 (z))dE^J 

and in the end we will use a telescoping argument to sum up all these one step 
differences to obtain (|4.26p . 

Now without loss of generality, we assume that 7 = <fi(a, b). Thus Mn and Mn 1 
only differ in the (a, b) and (6, a)-th elements. Let 

& — e a^b ■ 

Then we can write 

M^ = Q + n~ 1/2 y, M^ 1 = Q + n-^V, 

where 

V = (l-5 ab /2)(v ab E ab + v ba E ba ), 

V = (1 - 8 ab /2)(v ab E ab + v ba E ba ), 
and then Q is a random matrix independent of v ab and v ab . Let 

R{z) := (Q-z)- 1 . 

Moreover, for simplicity, we rewrite G 7_1 (z) and G 1 {z) as S(z) and T(z) respec- 
tively. And when there is no confusion, we will omit the variable z from the above 
notation. Now by the resolvent expansion, one has 

4 

S = R + n- k l 2 {-RV) k R + n- 5 / 2 (-RV) r °S. (4.27) 

k=l 

Then we can write 



E 2 9(5 xx (z) - ra-4rS(z))d£ N 

E(£ 2 9((i? xx (z)-n-Vi?(,)) 
4 

+ E ^^ /2 ([(- J R(^)^) fc «(^)]xx - n~Hr(-R(z)V) k R(z)) 
k=l 

+ n- 5 / 2 ([(-R(z)VfS(z)U ~ n-Hr(-R(z)VfS(z))^ dE^j 
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4 



E(9f(Jb + E n ~ k/2jr x + ^" 5/2 ^)) 4 . 



fe=l 



Here Ti 



Ti(a, b), i = • • • ,5 whose definitions are given by 



2 

(i? xx (z) - n~HrR{z))dE, 



f\[(-R(z)V) k R(z)}^ - n-Hr(-R(z)V) k R(z))dE, k = !,-■■ ,4, 



Jsi 
/•S2 



F 5 



/ ([(-i?(z)y) fc 5(z)] xx - n-Hr{-R{z)V) k S{z))dE. 



Observe that in the real case, every \{RV) k R] xx can be written as a summation 
of the terms in the form of 



where q fej0j6 (i?,x) is some product of the factors R xa , Rax, Rxb, Rbx, R aa , Rab, 
Rb a and R^. We remind here in the complex case, (f a f>) fc should be replaced by 
{v a b) kl {vba) k2 with k\ + /c2 = k. Moreover, the total number of the factors R xa , Rax, 
Rxb, Rbx m every qfc i0> &(i2, x) is 2. For example, 

\{RV) 2 i?] xx = (^ab) 2 {RxaRbbRax + RxbRabRax + RxaRbaRbx + RxbRaaRbx) ■ 

In the following Lemma 14.51 we will give some pre-bounds for the quantities T^. 
These bounds will be used to provide a pre-bound of 



through a comparison procedure. Then the pre-bound for (|4.28|) will imply an 
improved bound for Tq. Such an improved bound combined with another round of 
comparison can help us to obtain a good bound for (|4.28|) . In other words, our main 
route in the sequel is to use a "bootstrap" strategy to get a pre-bound of (|4.28|) at 
first and then use the pre-bound to get the final bound in Lemma 14.31 For ease of 
presentation, we will use the notation in [11] to set 



(v a b) <ik,a,b 



(i?,x) 




(4.28) 




Lemma 4.5. Under the assumptions in Lemma \4-3[ one has with overwhelming 
probability 



Mz)\<n°^ sup ^(z)(s 2 -s 1 ), (4.29) 





sup 

Ee[s 1 , S2 } 




(4.30) 
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Proof. First of all, by truncation, we have that the elements are bounded by rf . 
Moreover, by assumption one has 

2n- 1 / 2 " £ > 7/ > n~ 3 / 4 - 2e (4.31) 

for some sufficiently small e > 0. Thus we always have z = E + \J — 1-q £ S which is 
defined in (|2.3p . Now we come to verify (|4.29p . By definition, 

rs 2 



?0= [ \R™{z) - n- l trR{z))dE. 



Observe that 

n e > (logn) loglogn 

for sufficiently large n. Using the isotropic local semicircle law (|2.5p . one has with 
overwhelming probability 

\S xx {z) - m sc {z)\ < n°®W(z), \n~ l trS{z) - m sc {z)\ < n°^^(z). 

(4.32) 

Moreover, by (3.28), (3.29) of [11] and the discussions above them we know 



\S xa (z)\,\R xa (z)\ <n° 

and 



^ (\/^^ + * (z) + |Xa| ) (4 ' 33) 



1^(^)1,1^^)1=0(1) (4.34) 

with overwhelming probability. Moreover, analogously, the bound in (|4.33p also 
holds for S a x.(z), Ra*(z) with overwhelming probability. Then by ()4.27p . (I4.32p ~ 
(|4.34p one can get that 

\RxJjc) - m sc (z)\ < n° W tf (*), \n~ l trR{z) - m sc {z)\ < n° {t) ^{z). 

Thus ()4.29p follows immediately. 

Now we come to verify (|4.30p . Note that for 1 < k < 5, the total number 
of the factors R xa , R xb , R ax , R bx , S xa ,S xb ,S ax and S bx in each [(-RV) k R] xx or 
[(— RV) k S] xx is 2. Now let p be the total number of the factors R xa , Rax, S xa and 
S ax , and q be the total number of the factors R xb , R bx , S xb and S bx . Thus p + q = 2. 
Then by iHf and (Pi)) one obtains 



\T k \ < n°W sup Y. (\/^ EE ^ + *W + W) P 



5^) + ^) + | X6 | )?(s2 _ Sl) 



< n O(e) sup ( /^M + ^ ) + K |)2 (s2 _ Sl) 
BE[*i,«] V nT l 



2.--S 



Ee[ n ,sa] V 

< n O(e) gup (f^M^ + ^ {z) \ +{lxal 2 + lxbl 2 ) \ {s2 _ Si) 

Ee[ si ,s 2 )\\ nr) ) ) 

with overwhelming probability. Thus we conclude the proof of Lemma 14.51 □ 
Following from Lemma 14.51 and the definite bound 

l-^ol < V" 1 ^ - 

one can see that 



E(^ ) 4 <n°W sup ((J^M + l.)) ( S2 - Sl f<^)( S2 - Sl f. 

Here we used the definition of n in Lemma 14.31 However, relying on Lemma 14,5} we 
can provide a better bound on the 4-th moment of QJ-q by a "bootstrap" strategy. 
Precisely, we will show the following bound, 

E(^o) 4 < n- 3 / 2+ °^(s 2 - Sl ) 2 . (4.35) 

To verify (|4.35|) . we need the following pre-bound for (|4.28|) for any Wigner matrix 
M n satisfying the assumptions of Theorem 11.51 

Lemma 4.6. Under the assumptions of Lemma\4-3\ we have 



E ^J*($S(G„{z) - hrG{z)))dE^ < n - 3 / 2+0 ^(s 2 - Sl ) 2 . 

Before proving Lemma (|4.6p . we explain here how it implies (|4.35p . Note that by 
definition, 

3fJb = I 2 Q(R^(z) - n -HrR(z))dE. 

J si 

By (|4.27p . it suffices to show 

E^J* 2 %(S xx {z)-n-HrS(z))dE^ < n - 3 l 2+0( - e \s 2 - si) 2 . (4.36) 

Note Mn is also a Wigner matrix satisfying the asumptions in Theorem 11.51 Thus 
(|4.36p follows immediately. So does (|4.35p . Now we come to prove Lemma 14.61 



Proof of Lemma \4-6j Now we will use the mentioned strategy to estimate the one 
step difference 

E QT^CGJJz) - hr(P(z)))dE^ -k(J*(*(G£Hz) - UrG^ 1 {z)))dE^ 
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by inserting the pre-bounds of J-q and T\» provided in Lemma 14.51 at first, and then 
using the telescoping argument we will obtain the bound in Lemma 14.61 By the 
discussions above, we have 



£ VxxOz) - n-HrS{z))dE^j 

4 



l 



k=l 

5 4 

= A ab + E Y n ~ T!Llki/2l {Zt =1 k i >B}Il*F'H> ( 4 - 37 ) 

fei,— ,fc4=0 i=l 

where A a b is a quantity only depending on the first four moments of v a b and inde- 
pendent of v a b itself. Analogously, 

l 



j s ' V«(*) - rCHrT(z))dE^ 



5 4 
= Aab + E Y, n ~^ =lki/2l {Zt =1 k i >5}I[ C 3T ki ( 1 -l^ 1 ), (4.38) 

fci,— ,k^=0 i=l 

where — 1 — > 7) stands for the quantity defined through replacing V and S 1 
by V and T respectively in the definition of J-^. Therefore, to get the one step 
difference one only needs to estimate the second terms of (|4.37p and (|4,38p . We will 
only handle (|4.37p below. ()4.38p is just the same. 
Note that it suffices to estimate the contribution of 

En-/ 2 Y hzu^yll^- ( 4 - 39 ) 

ki,— ,fe4=0 i=l 

for 5 < k < 20. Now we denote the right hand sides of (|4.29p and (|4.3U|) by D\ and 
D 2 respectively. Note that by the assumption on rj, 

D l <n- 1 / A+ °^{s 2 -s l fl\ 

and 

D 2 < n' 1 ' 2+0 ^(s 2 - si) 1 / 2 + n°^{\x a \ 2 + \x b \ 2 )(s 2 - s x ). (4.40) 
with overwhelming probability. Combining with the definite bounds 

||i2(«)||op, HSWIIop^^" 1 , (4-41) 

we can use the upper bound in (|4.40p as a definite one when we take expectations 
towards the polynomials in D 2 . Now let m be the number of in the collection 
{ki, k 2 , ks, k^}. Then we have 

fcl,---,fc4=0 1=1 
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< Cn~ K/2 D ? ED t m 

m=0 

< Cn*l\s 2 - Sl ) 2 (n- 1/4+ ° (e) ) m (n-V** ® + n°^(\x a \ 2 + \x b \ 2 ) 



4—m 



m=0 

< „-«/2( ga _ Sl )2 £ ^ n -m/4+0(e)^ ^- ( 4-m)/2+0(e) + ^(^p + 
m=0 

Here we have used the fact that \x a \ < 1. Then it is not difficult to see 

i$> k/2e e ^^n^i 

a, 6 fci ,fc4=0 i=l 

< Y, n ~ R ' 2 (n~ 5/4+0(e) + n°W(|x a | 2 + \x b \ 2 )) (s 2 - Sl ) 2 

< n- K / 2+1+ °W( S2 - Sl ) 2 . (4.42) 

Here we have used the fact that J2 a \ x a\ 2 = 1- Then by (|4.37p . (|4.42p and the fact 
k > 5 one has 

|E (J* 2 (%G xx (z) - UrG{z)))dE\ - E (J" (Q(G„(z) - ^trG(z)))dE^j \ 

<n-^ 2+ °^{s 2 - Sl ) 2 . 

Using Lemma 14.31 for the Gaussian case which has been proved above, we can con- 
clude the proof. □ 

Now we proceed to the proof of Lemma f4.3l Again we will resort to the telescoping 
argument. But now we will use (|4.35|> instead of the pre-bound (|4.29|> . We go back 
to (|4.37]) and provide a better bound for (|4.39|) below. Note that by (|4.42|) . it suffices 
to consider the cases of k = 5 and n = 6. 

At first, we come to deal with the case of k = 5. To this end, we split the set 
{{ki, ■ ■ ■ , k^} : J2t=i hi = 5} into the following four cases. 

(i) : {k 1 ,k 2 ,k 3 ,k 4 } = {0,0,0,5}, 

(ii) : {k u k 2 , k 3 , k A } = {0, 0, 1, 4}, {0, 0, 2, 3}, 

(iii) : {h, k 2 , k 3 , k 4 } = {0, 1, 1, 3}, (0, 1, 2, 2), 

(iv) : {h,k 2 ,k 3 ,k 4 } = {1,1,1,2}. 

Note that for case (i) , by using (|4.41 j) , (|4.35p and Lemma 14.51 we see that 



-5/2 ■ 

a.b 
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^ -- 5/2 En( E (^) 4 ) i/4 

a, b i=l 

< n-"/ 2 n- 9 / 8+ °^) (n-Va+OW + n°^(\x a \ 2 + |x fe | 2 )) (s 2 - Sl ) 2 

a,b 

Now we come to evaluate the case (ii). 
0,6 

< n- 5 / 2 J] n- 3 / 4 +°^) (n-V^+OW + n°W(|x a | 2 + |x b | 2 )) * (* 2 - Sl ) 2 

< J n -HO( e ) +n O( £ ) ( | Xa |2 + | Xfc |2^ _ si) 2 
a,b 

= n - 9 / 4+ °W( S2 - Sl ) 2 . 

Thus both of these two cases can be bounded well by our estimates in (|4.30p and 
()4.35p . Now we come to deal with the cases (iii) and (iv) whose estimates need 
more accurate bounds on the products of GJ-fc. This relies on the observation that 
with overwhelming probability, 

|-7=k| < n OW sup J + ¥(*) + \x a \)J + ¥(*) + |x 6 |)(* 2 - *i) 
Ee[ Sl ,s 2 ] V n? ? y nry 

< ( n -V2+0(e) + n -l/4+0(e) (|Xa| + + ^(^^ fa _ fll) l/2. (4.43) 

Note that (|4.43|) is a result of fact that p = q = 1 for [— i?Vi?]xx- Thus when 
{k\, k 2 , ks, ki} = {0, 1, 1, 3}, one has 

n -5/2 £ |E9Jo(^i) 2 9^3| 

a. 6 

< ( n -l/2+0( £ ) +n -l/4 + 0(e) ( | Xa | + ) + |x a | |x b |) ' 
a, 6 

x (n-W) + n °W(|x a | 2 + |x b | 2 )) (s 2 - Sl ) 2 

< n- 5 / 2 ^ n- 3 / 8 +°W (n- 1+ °( f > + n" 1 /^)^^ + |x b | 2 ) + n ^|x a | 2 |x b | 2 ) 

a,b 

x (n-W«) + n°U(\x a \ 2 + |x b | 2 )) ( S2 - Sl ) 2 

< n- 19 / 8+0 ^( S2 - Sl ) 2 . 

For {k\, k 2 , ks, k{\ = {0, 1, 2, 2}, one has 

„-5/2 |E9Jo^l(^2) 2 | 
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< re -V2^ n -3/8+0 W (n-W(e) +n -l/4+0(e) ( | a . a | + | Xfe | ) + | Xa 1 1 Xb | ) 

a, 6 

x (n-WW + n °W(|x a | 2 + |x b | 2 )) 2 ( S2 - Sl ) 2 

< n- 5 / 2 ^ n- 3 / 8 +°^) (n-Va+OW + n-^+O^d^l + |x b |) + n°( £ >|x ||x 6 |) 

a, 6 

x(n- 1+0 W+n°W(|x a | 4 + |x fe | 4 )) ( S2 - Sl ) 2 

< (V 5 / 2 ^n- 3 / 8+0 W|x a ||x 6 |(n- 1+0 W + |x a | 4 + |x fc | 4 ) + n' 17 / 8+ °^A (s 2 - Sl ) 2 . 

^ a,b ' 

Note that ^ a \x a \ = 0{y/n). Thus 

^2\x a \\x b \=0(n), ^|x a | 5 |x fe | = 0{y/n). 

a,b a,b 

Thus we have 

n" 5 / 2 m^Ti^fl < n- 17 / 8+ °^( S2 - Sl ) 2 . 

a,b 

The estimation towards case (iv) is similar. We do it as follows. 

n - 5 / 2 ^|E(3f^i) 3 3f^ 2 | 

a,b 

< n- 5 /^( n -l/2 + 0( £ ) +n -l/4 + 0(e) (|la| + \ Xb \) +n O(e)\ Xa \\ Xb \y 

a,b 

x( n - 1 ^^ + n°^(\x a \ 2 + \x b \ 2 )) (s 2 - Sl f 

< n- 5 / 2 (n- 3/2+OW + n- 3 / 4 +°^(|x a | 3 + |x fe | 3 ) + n°^\x a \ 3 \x b \ 3 ) 

a,b 

x (n^W +n°W(K| 2 + |x 6 | 2 )) (, 2 - Sl ) 2 

< (n- 5 / 2 ^ |3 a | 3 |a%| 8 (rr 1/SHO(e) + n oW(| Xa |2 + |xfc|2)) + n -9/4+0(e)\ ^ _ Bi)a 

^ a,b ' 

< n -9/4+0(e) (s2 _ Si)2> 

In summary, we have 

lEn- 5 /^^ = n- 47 /8 + 0(e) (s2 _ si) 2_ (444) 

i=l 

Now we come to deal with the case of k = 6. When there is at least one ki equal 
to 0, it will be easy to check that the contributions of such terms are negligible by 
using (|4.30p and (|4.35|) . We leave the details to the readers. Now we still have the 
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case where there is no in {k\, k 2 , k$, k^}. Since k = 6, we have 

{h,k 2 , k 3 , k 4 } = {1, 1, 2, 2} or {1, 1, 1,3}. 

The discussions for these two cases are quite similar to that of (iv) for n = 5. 
Actually, we can get 

4 

|En- 3 l {Etifci=6} n^|=n- 5 / 2+ °«( S2 - Sl ) 2 . (4.45) 

i=l 

Then by (|4.42j) . (|4.44j) and (|4.45p we can complete the comparison procedure. Thus 
(|4T26|) follows. So does Lemma PI □ 
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